CDS

Accession Number TCMCG073C25421
gbkey CDS
Protein Id XP_010550977.1
Location join(632449..632547,632832..632930,633013..633078,633249..633326,633619..633781,634074..634318,634422..634779,635002..635073,635243..635514,635609..635665,635893..636180,636280..636458,636544..636826,637048..637224,637324..637485,637676..637753,637843..638124,638389..638553)
Gene LOC104821714
GeneID 104821714
Organism Tarenaya hassleriana

Protein

Length 1040aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268022
db_source XM_010552675.1
Definition PREDICTED: protein ALWAYS EARLY 2-like isoform X1 [Tarenaya hassleriana]

EGGNOG-MAPPER Annotation

COG_category K
Description binding, transcription factor
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCACCGGGTAGGAAGTCAAGGAGTGTGAACAAACGTTTCTCTAATGAAACTTCTCCGGAAAAAGATGTCTGGAATTCAAGCAAAAGCAAGCAGAGGAAGAAGAAATTGTCAGACAAGTTGGGACCTCAGTGGACCAAAGGGGAGCTTGAGCGTTTCTATGATGCCTATCGGAAGCATGGGCAAGACTGGAAGAAGGTAGCTGCCTCAGTGCGTAATAACAGATCTGTGGAAATGGTGGAAGCCCTTTTCTATATGAATCGGGCGTACTTATCCCTGCCTGAGGGAACTGCATCTGTGGCTGGTCTCATTGCAATGATGACTGATCATTACAGTGTCATGGATGGGAGCGACAGCGAAGGAGAAGGGCATGCTGCTTCTGGAGTGCGAAAGAAATATCAGAAGGGCAAGCGTACGAAAGAATCTCGAGAGGAAACTAATATACCACATTCAATTGCATCGGCAGATGGATGCCTATCTTTTTTGAAGCAGGCACAGGCTTTTGGATCTCAGCGTCGTGCTACCGGAAAACGTACACCCCGATTTCCTGTTCCAAGTTCTTACAGGAGGGAGGATAGAGAAGGGTCCACTCCACTTAGTAAAAGAGCTAGGAAGCCAGTCGATAATGACGATGTTGCACATTTTGTAGCACTAGCATTAACAGATGCATCAAAAAGGGGAGGTTCTGAGTCACCATATAGAAGAACGGAGCACAGTGACAGCTCACCAATTAAGAGCTGGGGGAAAATGTCACAAGCAAGGGAAGCTCAATCCAAGCTCCGTGATAGTTCCATGTGTGAAGAGTGGGTGGAAGGTAACCGAGAAAGGAAACGTAAGAAGGGAGGCTCTGATAGAGATGCTACCTCTTTGATGGATATGGAGGGGGTTGGCACAGTGGAGGTTCTACGGAGAGGGAAAAAGTTCTACGGGAAAAAAGCAAAAGTCGAAGAAGCAAAAGGCATTGATTCTGATGACAGTGGAGAAGCATGCAGTGCCACTGAAGGACTTAGAAGTAAATCACAGAAAGGAAAAGTTGATGTTGAGGCCTCAAGGGGGAAATTTTCACCACATGGCCCAAAGAAAAGGAATAAGAAGCTTCAATTTGGGGATGAATTTGATGCTCTACAAGCATTGGCTGAATTGTCAGCTTCAATGCTTCCGGCGGCTCTGATGGAATCGGAATCATCCGCCCAGTTAAAAGAAGAAGGGACCGCAAATGACATGGATGAGAAGTCTAGCACTCCAGAAGCCACATCCACAAGCCATCATGGGGAGAAATCAAAACAAGTGGAACCAGAAGAAAGTGTCCTTCATGCGATTTCAGCTGTTGAGAATACTAAATATAGAAAACCGAAATCCTCGAGGCAAGCATCGGCGGATGGTATTGCTGGGCCTGCAGGGAAGCAACAGCAGCAACCTAGTGGCACATTGAGAAGAAGGCGTAAACCAAAGGTGCTAGATGCTGAACCTCCAACAGATTCAAACCAGAACAAATCCACATTGATGAAGGAATCAGCTCATGATGAGAATGCTAAGTCTGTGGTTAAAACAAAACGTACTGTTCAAGTTCCAGCACAGTCGAAACAGTTGAAAACTGTTAAGACATTGGAGGAATCTTCTTCAGCTAGCGATAAGAAAACTTTACCGGTTTCTGCTTCTGCCAGTTTACCGCAGAAACCTCAGAACAGACGCAAGATGGGCCTGAAGAAAACGTTACAAGAGAGGGTTAAATTTTCTGAGACCACTCCTAAAGCTTCATATGTCAATGAATCTCTTTTAGAACATGAATTATTGAAGGAGAAGCTTTCATCATGTCTATCATATCCCTTGGCACGTAGGAGGTGCATATTTGAATGGTTCTATAGTGCTATTGACTATCCATGGTTTGCAAAGATGGAGTTCGTTGATTACTTAAATCATGTTGGACTTGGCCATGTTCCCAGACTTACTCGTCTTGAATGGAGTGTCATCAAGAGTTCGCTTGGTAGACCTAGGAGATTCTCTGAGAGATTCTTACAGGAGGAAAGGGATAAACTCAAACAATACCGTGAATCTGTGAGAAAGCATTACACAGAACTCCGGGCAGGTGCTAGGGAAGGACTTCCGACTGATTTGGCTCGGCCTTTATCGGTTGGGAATAAAGTTATTGCTATCCATCCTAAAACGCGAGAAATTCATGATGGGAAAATTCTCACTGTGGATCATAACAAGTGCAGCGTTCTGTTTGATCGTGATGACTTGGGGGTTGAGTTTGTTAAGGACATTGATTGCATGCCTTTGAATCCATTAGAGTACATGCCTGAAGGTCTGAGGAGGCAAATTGACATGTGCATGGCAATAAGCAAAGAAGCACATCTAAACAGACATCCAAATTTTGAAGGGTCTGTAATATTCCCTTCGAGTGTGCTTGAAAATGCTAGCTTATACATGAAACAGGGTGACACAAATGGACCGATTTTACAAGCTAAGATTTTGGCAACCAACACTACTAGTCCACAGCAGGCCACCAACAATCACCCTTTTATTACAACCTTTAGTAAAGCTAGAGAAGTTGAGATTCAACGAGCTCTGGCAGTGCAGCGTTCTCTAAATGAAAAGGAAATAGAGCCAGAAATGCTCGAAATTGTCAAGGGTTCAAAGTCAAGAGCTCAAGCAATGGTAGATGCAGCTATTAAGGCTGCATCTTCTGTGAAGGAAGGGGAAGATGCGAGTAAAAAGATCCAGGAGGCTATAGACTCCATAGGCAAACATCTGCCACTACACAGCTCTATGGTCCCTGGTGTCAAGCATCAAGAGCATGCCAATGGCAGCTTGGATCATCATCTCAGCCAGTCTCCCTCTGATGCATTAGAGTCACTGGTTAATGGTTCCATCTCACAGGACGGTTCAGGGAAAAACGAGGGGCAAATGCCTTCTGAGCTCATCTCCTCCTGCGTTGCCACTTGGCTCATGATTCAGATGTGCACGGAGAGACAGTATCCTCCAGCCGACGTGGCTCAGCTAATAGACACTGCAGTCATGAGCTTGCATCCGAGATGCCCTCAGAACCTGCCCATCTACCGAGAAATCCAGACGTGCATGGGTCGTATCAAGACTCAGATTCTTGCCCTTGTACCGACTTAA
Protein:  
MAPGRKSRSVNKRFSNETSPEKDVWNSSKSKQRKKKLSDKLGPQWTKGELERFYDAYRKHGQDWKKVAASVRNNRSVEMVEALFYMNRAYLSLPEGTASVAGLIAMMTDHYSVMDGSDSEGEGHAASGVRKKYQKGKRTKESREETNIPHSIASADGCLSFLKQAQAFGSQRRATGKRTPRFPVPSSYRREDREGSTPLSKRARKPVDNDDVAHFVALALTDASKRGGSESPYRRTEHSDSSPIKSWGKMSQAREAQSKLRDSSMCEEWVEGNRERKRKKGGSDRDATSLMDMEGVGTVEVLRRGKKFYGKKAKVEEAKGIDSDDSGEACSATEGLRSKSQKGKVDVEASRGKFSPHGPKKRNKKLQFGDEFDALQALAELSASMLPAALMESESSAQLKEEGTANDMDEKSSTPEATSTSHHGEKSKQVEPEESVLHAISAVENTKYRKPKSSRQASADGIAGPAGKQQQQPSGTLRRRRKPKVLDAEPPTDSNQNKSTLMKESAHDENAKSVVKTKRTVQVPAQSKQLKTVKTLEESSSASDKKTLPVSASASLPQKPQNRRKMGLKKTLQERVKFSETTPKASYVNESLLEHELLKEKLSSCLSYPLARRRCIFEWFYSAIDYPWFAKMEFVDYLNHVGLGHVPRLTRLEWSVIKSSLGRPRRFSERFLQEERDKLKQYRESVRKHYTELRAGAREGLPTDLARPLSVGNKVIAIHPKTREIHDGKILTVDHNKCSVLFDRDDLGVEFVKDIDCMPLNPLEYMPEGLRRQIDMCMAISKEAHLNRHPNFEGSVIFPSSVLENASLYMKQGDTNGPILQAKILATNTTSPQQATNNHPFITTFSKAREVEIQRALAVQRSLNEKEIEPEMLEIVKGSKSRAQAMVDAAIKAASSVKEGEDASKKIQEAIDSIGKHLPLHSSMVPGVKHQEHANGSLDHHLSQSPSDALESLVNGSISQDGSGKNEGQMPSELISSCVATWLMIQMCTERQYPPADVAQLIDTAVMSLHPRCPQNLPIYREIQTCMGRIKTQILALVPT